Robust Dynamic Locomotion via Reinforcement Learning and Novel Whole Body Controller

نویسندگان

Donghyun Kim

Jaemin Lee

Luis Sentis

چکیده

We propose a robust dynamic walking controller consisting of a dynamic locomotion planner, a reinforcement learning process for robustness, and a novel whole-body locomotion controller (WBLC). Previous approaches specify either the position or the timing of steps, however, the proposed locomotion planner simultaneously computes both of these parameters as locomotion outputs. Our locomotion strategy relies on devising a reinforcement learning (RL) approach for robust walking. The learned policy generates multi step walking patterns, and the process is quick enough to be suitable for real-time controls. For learning, we devise an RL strategy that uses a phase space planner (PSP) and a linear inverted pendulum model to make the problem tractable and very fast. Then, the learned policy is used to provide goal-based commands to the WBLC, which calculates the torque commands to be executed in full-humanoid robots. The WBLC combines multiple prioritized tasks and calculates the associated reaction forces based on practical inequality constraints. The novel formulation includes efficient calculation of the time derivatives of various Jacobians. This provides highfidelity dynamic control of fast motions. More specifically, we compute the time derivative of the Jacobian for various tasks and the Jacobian of the centroidal momentum task by utilizing Lie group operators and operational space dynamics respectively. The integration of RL-PSP and the WBLC provides highly robust, versatile, and practical locomotion including steering while walking and handling push disturbances of up to 520 N during an interval of 0.1 sec. Theoretical and numerical results are tested through a 3D physics-based simulation of the humanoid robot Valkyrie.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcement Learning Based PID Control of Wind Energy Conversion Systems

In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...

متن کامل

Learning to Acquire Whole-Body Humanoid Center of Mass Movements to Achieve Dynamic Tasks

This paper presents a novel approach for acquiring dynamic whole-body movements on humanoid robots focused on learning a control policy for the center of mass (CoM). In our approach, we combine both a model-based CoM controller and a model-free reinforcement learning (RL) method to acquire dynamic whole-body movements in humanoid robots. (i) To cope with high dimensionality, we use a model-base...

متن کامل

Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot

A class of biped locomotion called Passive Dynamic Walking (PDW) has been recognized to be efficient in energy consumption and a key to understand human walking. Although PDW is sensitive to the initial condition and disturbances, studies of Quasi-PDW which incorporates supplemental actuators have been reported to overcome this sensitivity. In this article, we propose a reinforcement learning m...

متن کامل

Robust Reinforcement Learning Control with Static and Dynamic Stability∗

Robust control theory is used to design stable controllers in the presence of uncertainties. This provides powerful closed-loop robustness guarantees, but can result in controllers that are conservative with regard to performance. Here we present an approach to learning a better controller through observing actual controlled behavior. A neural network is placed in parallel with the robust contr...

متن کامل

Reinforcement Learning of Robotic Legged Locomotion

Humans and animals show a remarkable level of proficiency in their ways of locomotion. They exploit the dynamics of the whole body to perform a variety of motions such as jumping and running. Hereby, the elasticity in the muscles and tendons carries a key role in enabling robust, dynamic and energy efficient locomotion [1]. At the Autonomous Systems Lab, we have developed the robotic leg ScarlE...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1708.02205 شماره

صفحات -

تاریخ انتشار 2017

Robust Dynamic Locomotion via Reinforcement Learning and Novel Whole Body Controller

نویسندگان

چکیده

منابع مشابه

Reinforcement Learning Based PID Control of Wind Energy Conversion Systems

Learning to Acquire Whole-Body Humanoid Center of Mass Movements to Achieve Dynamic Tasks

Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot

Robust Reinforcement Learning Control with Static and Dynamic Stability∗

Reinforcement Learning of Robotic Legged Locomotion

عنوان ژورنال:

اشتراک گذاری